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Abstract 

Over the past decade quantum information theory has developed into a vigorous 
field of research despite the fact that quantum information, as a precise concept, is 
undefined. Indeed the very idea of viewing quantum states as carriers of some kind 
of information (albeit unknowable in classical terms), leads naturally to interesting 
questions that might otherwise never have been asked, and corresponding new insights. 
We will discuss some illustrative examples, including a strengthening of the well known 
no-cloning theorem leading to a property of permanence for quantum information, and 
considerations arising from information compression that reflect on fundamental issues. 

1 Introduction 

Perhaps the most intriguing product of quantum information theory is the concept of quan- 
tum information itself. In the early 1990s Charles Bennett was one of the first workers 
to recognise and promote this new concept, establishing the foundations of a new subject. 
Taken as a primary ingredient, quantum information cannot be defined, but the viewpoint it 
fosters is richly suggestive, leading to new interesting questions and modes of interpretation 
for some quantum processes. In this paper we will explore a few examples. 

A quantum state may be viewed as a carrier of information in two fundamentally 
different ways. Firstly \ifj) may be regarded as carrying the classical information of the 
state identity. As an example, a sender may prepare one of the two (non-orthogonal) states 
\ipo ) and \ipi) to encode the bit values and 1 respectively. Then the receiver's task is to 
regain the value of i from If Pj-u denotes the probability that he outputs j when the 
state was \tpi) and qi is the probability that \ipi) was sent, then he may, for example, apply 
a procedure that minimises the error probability p 2 | 1Q1 + P\\2Q2- In this way the available 
information in the quantum state is similar to the result of classical communication through 
a noisy channel and it is well known that if (V'llV^) 7^ then the minimum error probability 
cannot be zero i.e. the state \ipi) cannot be perfectly identified by any physical process. 

In a second way, may be viewed as the carrier of "quantum information" which 
although we leave undefined in more fundamental terms, we intuitively think of as "the state 
itself" . Quantum information is a new concept with no classical analogue and it is important 
to distinguish it from the state identity. For example, given a physical realisation of one of 
the two states above, quantum theory considerably restricts (in a richly structured way) 
the allowable manipulations that we can perform, in contrast to what is possible if we are 
given the identity of i. Indeed "being given the quantum state iV'i)" is very different from 
being given any kind of classical information and by an analogy of terminology we apply 
the name quantum information to describe what we have received. Stated more formally, 
we would aim to formulate and interpret quantum physics in a way that has a concept 
of information as a primary fundamental ingredient. Primary fundamental concepts are 



ipso facto undefined (as a definition amounts to a characterisation in yet more fundamental 
terms) and they acquire meaning only afterwards, from the structure of the theory they 
support. 

As a first example consider the process of quantum teleportation (cf for full details): 
Alice (A) succeeds in transferring a qubit state to Bob (B) (distantly separated in space) 
by sending only two classical bits of information. A and B also need to share an entangled 
(EPR) pair which is destroyed in the process. We would like to think of teleportation as the 
transmission of quantum information from A to B. If we accept the intuitively appealing 
tenet, that a transfer of information from sender to receiver must always be mediated by a 
channel connecting the two participants, then teleportation appears paradoxical - only two 
classical bits were sent so how did the full quantum information pass from A to B? Looking 
at the standard spacetime diagram of the teleportation process (cf figure 1(a) of P) we 
see that there is indeed a second (V shaped) path connecting A to B, which is defined 
by the two worldlines of the distributed EPR particle pair. This leads to an intriguing 
interpretation (first proposed by Bennett soon after the discovery of teleportation): in 
addition to the two bits, the remaining quantum information must have been propagated 
backwards in time from A to the EPR source and thence forwards in time to B 1 . Indeed if 
we insist that information transmission requires a physical channel then there appears to 
be no other possible interpretation of the teleportation process! It is remarkable that this 
interpretation is entirely consistent: the principles of quantum measurement theory imply 
that the information sent backwards in time is random and independent of the teleported 
state, so long as the two classical bits remain unknown. Hence the well known classical 
causal paradoxes of backwards in time information propagation are neatly circumvented. 
This analysis, inspired by our informational point of view, also reveals a new significance 
for entanglement in quantum theory (beyond the traditional issues of non-local correlations 
of measurement outcomes): entanglement can be viewed as providing a channel for the 
transmission of quantum information. 

In the following sections we will discuss two further issues in which an informational 
point of view leads to interesting considerations. Firstly we will revisit the quantum no- 
cloning theorem [I] and prove a new stronger form of this result. Together with the Pati- 
Braunstein no-deleting principle jHj this will lead to a property of "permanence" for quantum 
information. Secondly we will discuss the concept of information compression. In classical 
information theory this provides one of the clearest approaches to the concept of information. 
By mimicking this theory in a quantum context we obtain some surprising relationships 
between the concept of information and the geometry of Hilbert space (i.e. the basis of the 
conventional formulation of quantum theory) . 

X A similar interpretation involving backwards in time propagation was proposed earlier for the Bennett- 
Wiesner dense coding protocol in [2] and attributed there to Ben Schumacher. For further developments of 
this idea see also 
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2 A stronger no-cloning theorem 

It is well known that (non-orthogonal) pure quantum states cannot be cloned jlj i.e. if 
{IV'i)} i s a se t of pure states containing at least one non-orthogonal pair then no physical 
operation can achieve the transformation \tfji) — * Although the impossibility of 

cloning in quantum theory can be attributed to the fact that such a process is non-unitary 
or non-linear, from an informational point of view we can intuitively understand it by saying 
that two copies of a quantum state embody strictly more "information" than is available 
in just one copy, so cloning must be impossible. Extending this particular line of thought, 
it is then natural to go on to ask: what additional (quantum) information is needed to 
supplement one copy \ipi) in order to be able to produce two copies |V>i)? For classical 
information, no supplementary information at all is needed and one might guess that as 
the set {(V'i)} becomes "more classical" the necessary supplementary information should 
decrease in some suitable way. However we prove below that this is not the case: we show 
(for mutually non-orthogonal states) that the supplementary information must always be 
as large as it can possibly be i.e. the second copy \ipi) can always be generated from the 
supplementary information alone, independently of the first (given) copy. Thus in effect, 
cloning of is possible only if the second copy is fully provided as an additional input. 

We now give a precise formulation of the main result in this section. By a physical 
operation we will mean a trace preserving completely positive map. Note that this definition 
excludes the collapse of wavefunction in a quantum measurement, as a valid physical process. 
(This will be relevant to our later discussion of the no-deleting principle.) By an abuse of 
notation for pure states we will write \ip)(^\ <8> p as just \tp) £g> p and sometimes also omit 
the tensor product symbol, writing \ r ip){ij)\ (g> as IV') \' l P)- 



Theorem 1 Let be any finite set of pure states containing no orthogonal pairs of 

states. Let {pi} be any other set of (generally mixed) states indexed by the same labels. 
Then there is a physical operation 

\ipi) ® Pi — ► \ipi) 
if and only if there is a physical operation 

Pi — ► H>i) 

i.e. the full information of the clone must already be provided in the ancilla state pi alone. 

Remark 5 . If the set contains some orthogonal pairs then the unassisted clonability 

of orthogonal states spoils the simplicity of the statement of theorem 1. As an example 
consider 

= |0) |ai) = |o) 

1^2) = |1) \a 2 ) = I a) 

|^3> = ^([0) + |1» |a 3 > = |6) 

where \a) and \b) are orthogonal. Then clearly \-ipi) \a.i) — > is possible (as {\tpi) 

is an orthonormal set) but |a») — > \ipi) is not possible (as \a\) = \a,2) but l^i) 7^ IV^))- 
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Indeed the |aj) states here provide reliable distinguishability of i values exactly when this 
is not already provided by the |V>i)' s themselves. 

To prove the theorem we will use the following lemma which is proved as lemma 1 of 

EH- 

Lemma 1 Let {\cti}} and be two sets of pure states (indexed by the same labels). 

Then the two sets have equal matrices of inner products (i.e. {cti\ctj) = ((3i\f3j) for alii and 
j) if and only if the two sets are unitarily equivalent ( i. e. there exists a unitary operation 
U on the direct sum of the state spaces of the two sets with U \cti) = for all i). 
Proof of theorem^ Suppose that there is a physical operation pi — > Then clearly 
IV'i) ® Pi -> IV'i) IV'i) is allowed. 

Conversely suppose that there is a physical operation 

\i)i)®Pi — > IV*) IV'i) • (1) 

Consider first the case that pi are pure states, |aj) say. The physical operation eq. Q may 
be expressed as a unitary operation if we include an environment space, initially in a fixed 
state denoted \A). For clarity we include an extra register, initially in a fixed state |0), that 
is to receive the clone of IV'i)- Then eq. (^Q) may be written as a unitary transformation 

Mi) io> h) \a) — > m iVi) ico 

where |Cj) (generally depending on i) is the output state of the two registers that initially 
contained \a.j) \A). Hence by unitarity, the two sets {(^i) |c^}} an d |^«) have 
equal matrices of inner products and then, so do the sets an d |Ci)} (by a simple 

cancellation of {ipi\ipj) from the two initial matrices). Thus by lemma ^ these two sets 
are unitarily equivalent so \ipi) can be generated from \at) alone (by applying the unitary 
equivalence and discarding the |Cj) register). 

If pi are mixed we express them as probabilistic mixtures of pure states 

k 

(where all p[ are non-zero) . Then a physical operation achieves 

\lpi) (X) Pi ► l^i) |^i) for a11 * 

if and only if it achieves 

\ipi) ®\af) — >\A)\4>i) for all i and k. (2) 

By the pure state analysis above, a physical operation effecting eq. (J2J) exists if and only if 
there is a physical operation effecting 

v£ \ — ► IV'i) f° r an i an d k 

and then we get pi — > IV'i) too. QED. 
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In the particular case of cloning assisted by classical information (i.e. the states pi are 
required to be mutually commuting) we deduce that this supplementary data must contain 
the full identity of the states as classical information. Indeed if the pi are classical then 
they can be copied any number of times so if we can make one clone of from pi, we can 
make arbitrarily many clones and hence determine the the identity of i.e. the classical 
information of the label i must be contained in the supplementary classical information. 

The proof of theorem^is easily adapted to prove the following generalisation. Let {(V'i)} 
be any finite set of pure states containing no orthogonal pairs of states. Let \ipi)® n denote 
the state of n copies of Then there is a physical operation 

\^f n ® Pi — \^r (n+i) 

if and only if there is a physical operation 

Pi — ► \tpi) ■ 

Curiously, the increasing information contained in n copies of \ipi) (as n increases) can never 
be used to assist in the creation of even a single extra copy. 



3 No deleting 

Our techniques may also be used to give a simple proof of the Pati-Braunstein no-deleting 
principle [0] for sets {(V'i)} that contain no orthogonal pairs. The issue here is the following. 
Suppose we have two copies \tpi) \ipi) of a state and we wish to delete one copy by a physical 
operation: 

|&) — > |0> (3) 

(where |0) is any fixed state of the second register). As before, any such physical operation 
may be expressed as a unitary operation if we include an environment space, initially in a 
fixed state \A) say. Then eq. (jHJ) is equivalent to the unitary transformation 

|^) \A) \A) — |^) |0> \A) (4) 

where the final state \Ai) of the environment may depend on \ipi) in general. One way 
of achieving this is to simply swap (a constant) part of the environment into the second 
register but then the second copy of remains in existence (albeit in the environment 
now). The no-deleting principle states that the second copy of \tpi) can never be "deleted" in 
the sense that \tpi) can always be resurrected from \Aj). Note however, that if wavefunction 
collapse is also allowed as a valid physical process then deletion is possible. (We perform 
a complete measurement on and rotate the seen post-measurement state to |0) by a 
unitary transformation that depends on the measurement outcome.) 

To see the no-deleting principle with our methods, note that the unitarity of eq. © 
implies that the sets 1^.)} and {IV'i) |0) have equal matrices of inner products 

and then as before, so do the sets {(V'i)} and Thus lemmaHstates that these sets 

are unitarily equivalent, which is just the no-deleting principle. 
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4 Permanence of quantum information 



Deleting and cloning have a common feature: in cloning we saw that the existence of the first 
copy \tpi) provided no assistance in constructing the second copy from the supplementary 
information. Similarly for deletion, the existence of the first copy provides no assistance in 
deleting the second copy - in effect the only way to delete the second copy is to transform 
it out into the environment (i.e. |0) \Ai) in eq. @ is a unitary transform of \ipi) \A) alone) 
and this again makes no use of the first copy. Considering no-cloning and no-deleting 
together (and excluding wavefunction collapse as a valid physical process) we see that 
quantum information (of non-orthogonal states) has a quality of "permanence": creation 
of copies can only be achieved by importing the information from some other part of the 
world where it had already existed; destruction (deletion of a copy) can only be achieved 
by exporting the information out to some other part of the world where it must continue 
to exist. This property is different from the preservation of information by any reversible 
dynamics. For example the classical reversible C-NOT operation can imprint copies of a 
bit b into a standard state via \b) |0) — > \b) \b) and also delete copies via \b) \b) — > \b) |0) but 
in both cases the first copy is used in an essential way in the process and the information 
content of one copy is the same as that of two copies. In contrast, in the quantum (non- 
orthogonal) case, copying and deleting can only occur independently of the first copy and 
then reversibility of dynamics implies that the information of the second copy must have 
already separately existed in the environment (for cloning) or continue to exist separately 
in the environment (for deletion). But in any reasonable intuitive sense, \ipi) does 
not have double the information content of (and similarly \ipi)® n cannot have n times 
the information content, as n is unbounded). One might interpret this as an overlap of 
information content of the two copies and then theorem ^ implies that this common part 
cannot be duplicated from within a single copy and merely extended, to give the second 
copy. 



5 Information compression and Hilbert space geometry 

So far our discussion of quantum information has been qualitative. However it would be 
interesting to develop a quantitative theory of this concept, being able to say that one 
quantum system has more quantum information than another, and we would like to have 
corresponding dynamical laws for the manipulation of quantum information. In classical 
information theory there exists a well defined quantitative notion of information and as a 
first attempt we can consider importing it into a quantum context. 

In Shannon's classical information theory we begin with a classical information source 
which is defined by a prior probability distribution {p{\ of signals Sj. The information con- 
tent is then quantified by the Shannon entropy H{pi) = — J2iPi ^°&2Pi bits- This definition 
has a compelling physical interpretation: it characterises the minimal resources (H bits 
per signal) that are necessary and sufficient to faithfully represent the source (in a suitable 
asymptotic sense j7] that we need not elaborate here). To approach the concept of quantum 
information, a natural avenue is to mimic this very successful classical theory in a quan- 



6 



turn context. Thus we introduce a quantum source, characterised by a prior probability 
distribution {pi} of quantum signals |a$) and define its quantum information content to 
be S, the least number of qubits that are necessary and sufficient to faithfully represent 
the source (in an asymptotic sense that naturally generalises the classical case). Following 
Schumacher, one may prove [Hj that S is then the von Neumann entropy S(p) of the source 
density matrix p = J^iPi \ a i)( a i\i establishing a fundamental role for von Neumann entropy 
in quantum information theory. 

This notion of quantum information, while interesting, is perhaps not entirely satis- 
factory in that it still involves an essentially classical ingredient viz. the prior classical 
probability distribution. However in this context it should be pointed out that that there is 
an unexpected and remarkable harmony in such classical mixing of quantum information: if 
£\ = {\ati) ;pi} and £2 = {|/%) ; Qj} are two quantum sources with the same density matrix 
Y^iVi = J2j Qj then £\ and £2 are entirely indistinguishable by any physical 

process. The quantum information of the |aj)'s probabilistically mixed by pi is exactly the 
same as the quantum information of the \/3j)'s mixed by qj; no trace of the component 
states remains in the mixture! This indistinguishability can be seen to be related to various 
other consistency requirements of a physical theory such as the no superluminal signalling 
principle [SJ. 

A second difficulty with the proposal of identifying the quantum information content 
of a source with its von Neumann entropy S is the fact that very different sources can 
have the same entropy yet some look "more quantum" than others! This was realised soon 
after the appearance of Schumacher's compression theorem and discussed by Bennett and 
other participants during the first Elsag-Bailey quantum computing workshop at the ISI 
in Torino in July 1993. It was suggested that a quantum source might be decomposable 
into a classical and a quantum part, with the von Neumann entropy quantifying both parts 
together. Then we would seek to separate out a maximal classical part and quantify the 
quantum part alone [TU]. To illustrate the situation consider a source which emits one of 
two orthogonal states |t/>o) an d with equal prior probabilities of \. As the states can 
be reliably identified by a measurement, this source can be represented entirely in classical 
terms with S = 1 classical bit per signal. Suppose now that the states are not quite 
orthogonal e.g. KV'olV'i}! = 10 -9 . The von Neumann entropy is still very close to 1 and 
we ask: is this source "almost classical"? i.e. can we extract approximately 1 classical bit 
of information leaving behind a very small amount of quantum information (e.g. almost 
parallel states) in such a way that the signals can still be faithfully reconstructed from the 
classical and quantum information parts? This question was settled only recently jllj , in the 
negative: Let £ be a quantum source whose signal states do not lie in a family of orthogonal 
subspaces. Then £ can be faithfully compressed to a qubits per signal plus any number 
of classical bits per signal if and only if a is at least as large as the von Neumann entropy 
S i.e. it is generically impossible to separate a source non-trivially into a classical and a 
quantum part, and the classical representation of exactly orthogonal states is therefore a 
singular feature of infinite precision. 

Thus we need to look at more subtle properties of quantum compression to distinguish 
sources with equal entropy. Following a further suggestion of Bennett, we can study features 
of so-called visible quantum compression. In this scenario the source is described by giving 
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the classical information of the emitted signal state's identity (rather than the quantum 
state itself, as quantum information). Our task again is to faithfully represent the signal 
states with minimal resources and now we have more possibilities for coding. As an example 
consider a source of four signal states, with equal prior probabilities of \ and having von 
Neumann entropy 1. In visible compression we can represent this source with 2 classical 
bits per signal and no qubits (as there are four equiprobable possibilities) or with 1 qubit 
per signal and no classical bits (by creating the signal states and performing Schumacher's 
quantum compression on them). In between these two extremes there is a trade-off curve 
q(c): if we have c classical bits per signal (with < c < 2) then q{c) is the least number of 
qubits per signal that is sufficient to represent the source (so the above gives q(0) = 1 and 
q{2) = 0). Thus instead of trying to extract classical information from a quantum source, we 
start by giving a fully classical description of the source and consider the trade-off involved 
in coding the source back into quantum terms. An extensive study of this trade-off curve 
is given in ^2] and it is found that the shape of the curve does indeed distinguish different 
sources with the same entropy. 

Returning now to information compression and the insight it may provide into the 
notion of information, it is interesting to ask why compression is possible at all. Evidently 
some kind of redundancy in the raw signals is being eliminated. For a classical source it 
is well known that non-trivial compression is possible if and only if the prior probability 
distribution is not uniform. For example consider the case of two signals with unequal 
probabilities. In that case we already have some prior knowledge of the signal before it is 
received, in the sense that we can guess the signal (choosing the more probable one) and be 
correct more often than not. In this sense part of the signal (if sent in full) is redundant. 

The quantum situation is considerably more subtle. For any quantum source {\cti) ',Pt} 
we have S(p) < H(pi) with equality if and only if the signals are all mutually orthog- 
onal, suggesting that there is a quantum redundancy associated specifically with non- 
orthogonality. For example consider two qubit signals l^i) and [^2) at 45° with equal 
prior probabilities of i. Then H(pi) is 1 bit but S(p) is 0.601 qubits. Moreover S(p) 
decreases monotonically from 1 to as the overlap |(^i|^>2)| 2 is increased from to 1. 

The interpretation of non-orthogonality is one of the enigmas of quantum theory. Con- 
ventionally the overlap KValV^)) 2 provides a measure of the non-distinguishability of \ip\) 
and ^2) and this is reflected in the properties of the von Neumann entropy above viz. an in- 
creasing redundancy of quantum information with increasing overlap. Thus if £\ = {\cti) ; — } 
and £2 = i ^} are two quantum sources with n states each (having all prior proba- 
bilities equal, for simplicity) and with larger overlaps \(Pi\(3j)\ 2 > \(ai\a.j}\ 2 for each pair 
in £2 compared to the corresponding pair in £1, then we would expect a decrease of infor- 
mation content in passing from £\ to £2- While this is true for n = 2 it can be shown to 
fail genericailv|13| for n = 3 and higher i.e. it is generically possible to increase the von 
Neumann entropy of a source while increasing the overlaps of every pair of signal states! 
Evidently the quantum information content depends on more subtle structural properties 
of the geometry of the signals in the Hilbert space, beyond just the pairwise overlaps. 

The relationship of quantum information to the geometry of the Hilbert space is largely 
unstudied but for n = 3 we can say a little more[T3*]. The von Neumann entropy S of three 
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equiprobable states j^i), \ip2)} I ^3) is a function of four independent real parameters: the 
three overlaps an = \(ipi\ip2)\ 2 , 023 = KV^I^)! 2 , a 3 i = |(V'3|V ! l)| 2 an d the phase £ of the 
triple product T = (V'i|V , 2)(V ; 2|V ; 3)(V ; 3|V ; i) (noting that the squared modulus of T is the 
dependent quantity 012023031 )• Then keeping 012, 023 and 031 fixed we can vary £ and it 
can be shown [T2] that S is actually a monotonically decreasing function of cos £ (or lZe(T)). 
Despite this clean relationship, we still lack an intuitive understanding of why increasing 
the phase £ allows increased compressibility. 

In this paper we have promoted a viewpoint that attempts to place a notion of informa- 
tion at a primary fundamental level in the formulation of quantum physics. In the spirit of 
Landauer's slogan "Information is physical!" we would declare "Physics is informational!" 
Physical theories have traditionally been formulated in conceptual and mathematical terms 
that are, at root, geometrical. As such, they have an intuitive accessibility which has facil- 
itated many developments (for example, the powerful guiding principles of symmetry and 
co-ordinate invariance in the construction of Lagrangians and field equations). Similarly 
the concept of information has an intuitive basis although not geometrical (and evidently 
having a complicated relation to the geometry of state space) . Hence it offers a potentially 
new perspective on quantum physics with its own characteristic flavour of guiding princi- 
ples. For example we might adopt the principle that any prospective physical theory should 
not allow the efficient solution of an NP complete problem. This principle greatly restricts 
the form of the theory yet remarkably, it appears to hold in the established formalisms of 
classical and quantum physics, which developed from entirely different perspectives. Al- 
though an informational and geometrical formulation of a given physical theory would be 
mathematically equivalent, both points of view are valuable for further developments: nat- 
ural generalisations that these respective formalisms suggest would be quite different and 
no longer equivalent as theories. 
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